Gaining phonetic knowledge whilst improving synthetic speech quality?
نویسندگان
چکیده
منابع مشابه
High-quality speech synthesis for phonetic speech segmentation
This paper presents an original technique for solving the phonetic segmentation problem. It is based on the use of a speech synthesizer for the alignment of a text on its corresponding speech signal. A high-quality digital speech synthesizer is used to create a synthetic reference speech pattern used in the alignment process. This approach has the great advantage on other approaches that no tra...
متن کاملLearning effects for phonetic properties of synthetic speech
We address the question of what is learned while listening to synthetic speech produced by means of diphone-based synthesis. In standard diphone-based speech synthesis, the diphone database contains a single token for each phoneme transition. Learning may occur at different levels: listeners may learn the mapping between acoustic properties of particular diphones and their phonemic labelling; o...
متن کاملImproving consistence of phonetic transcription for text-to-speech
Grapheme-to-phoneme conversion is an important step in speech segmentation and synthesis. Many approaches are proposed in the literature to perform appropriate transcriptions: CART, FST, HMM, etc. In this paper we propose the use of an automatic algorithm that uses the transformation-based errordriven learning to match the phonetic transcription with the speaker’s dialect and style. Different t...
متن کاملMethods for Integrating Phonetic and Phonological Knowledge in Speech Inversion
Exploiting the information about the vocal tract shape that produced the speech has been appealing to speech researchers and scientists for a long period of time. Experimental studies that included the articulatory information from physiological measurements supported the idea that this information could be useful in a number of areas of speech science and technology. However, the estimation of...
متن کاملPhonetic Classification on Wide-Band and Telephone Quality Speech
Benchmarking the performance for telephone-network-based speech recognition systems is hampered by two factors: lack of standardized databases for telephone network speech, and insufficient understanding of the impact of the telephone network on recognition systems. The N-TIMIT database was used in the experiments described in this paper in order to "calibrate" the effect of the telephone netwo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Phonetics
سال: 1991
ISSN: 0095-4470
DOI: 10.1016/s0095-4470(19)30308-0